How to Annotate Linguistic Information in FILES and SCAT

نویسنده

  • Rodolfo Delmonte
چکیده

We present a suite of applications used for the Italian Treebank which share their linguistic processor and end up finally in higher level annotation tool called “FILES”. The first application “FILES” – Fully Integrated Linguistic Environment for Syntactic and Functional Annotation is a prototype for a fully integrated linguistic environment for syntactic functional annotation of corpora. It takes as input tagged and disambiguated tokenized texts, a file containing the same text split into sentences, and a files containing the morphosyntactic and semantic features associated to each tagged token. Tokens may be aither single words, polywords, abbreviations or punctuation marks. An as yet separated module of “FILES” is the syntactic constituency annotation environment which uses a shallow parser on the same tagged files and produces a fully bracketed output where each sentence is a record. Files contaning bracketed sentences are given as input to Syntactic Constituency Annotation Tool “SCAT” for manual verification.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Study of the Relationship between Acoustic Features of “bæle” and the Paralinguistic Information

Language users benefit from special phonetic tools in order to communicate linguistic information as well as different emotional aspects and paralinguistic information through daily conversation. Having functions in conveying semantic information to listeners, prosodic features form the essential part of linguistic behavour, manipulating  them potentially can play an important role in transmitt...

متن کامل

Bourdieu and Genette in Paratext: How Sociology Counts in Linguistic Reasoning

While Bourdieu’s theory of practice provides an ensemble of conceptual tools which analyze patterns of social life that are irreducible to the limiting view of individuals as free-acting agents, Genette’s paratextual theory offers the metalanguage necessary to account for the microcosm of paratext as a linguistic space. This study takes issue with unidirectional approaches to researching parate...

متن کامل

Inter-Annotator Agreement on a Linguistic Ontology for Spatial Language - A Case Study for GUM-Space

In this paper, we present a case study for measuring inter-annotator agreement on a linguistic ontology for spatial language, namely the spatial extension of the Generalized Upper Model. This linguistic ontology specifies semantic categories, and it is used in dialogue systems for natural language of space in the context of human-computer interaction and spatial assistance systems. Its core rep...

متن کامل

A new method for venom extraction from venomous fish, Green Scat

Scatophagus argus argus (Green Scat) is a pretty aquarium fish. Its hard spines are venomous and can cause painful injury. In this study 60 specimens of Green Scat were collected periodically from coastal waters of Boushehr (south of Iran) from May 2011 to April 2012. Anatomical features of venomous spines were investigated. Scat venom was extracted from the spines in a new manner for keeping t...

متن کامل

How To Integrate Linguistic Information In FILES And Generate Feedback For Grammar Errors

We present three applications which share some of their linguistic processor. The first application “FILES” – Fully Integrated Linguistic Environment for Syntactic and Functional Annotation is a fully integrated linguistic environment for syntactic and functional annotation of corpora currently being used for the Italian Treebank. The second application is a shallow parser – the same used in FI...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007